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Abstract 

In this paper, we introduce the k x n (with k < n) truncated, supplemented Pascal matrix which 
has the property that any k columns form a linearly independent set. This property is also present 
in Reed-Solomon codes; however, Reed-Solomon codes are completely dense, whereas the truncated, 
supplemented Pascal matrix has multiple zeros. If the maximal-distance separable code conjecture is 
correct, then our matrix has the maximal number of columns (with the aformentioned property) that 
the conjecture allows. This matrix has applications in coding, network coding, and matroid theory. 


1 Introduction 

Finite field linear algebra is an import branch of linear algebra. Instead of using the infinite field R, it uses 
linearly independent vectors consisting of a finite number of elements, which can be represented by a finite 
number of bits. It has thus motivated many practical coding techniques, such as Reed-Solomon codes m 
and linear network coding [ail]. It is also closely related to structural matroid theory [13] through matroid 
representability naiTaisiiie]. 

One of the most important problems in finite field linear algebra is finding the size of the largest set of 
vectors over a fc-dimensional finite field such that every subset of k vectors is linearly independent mm- 
From a matrix perspective, the problem is described as: 

Problem 1.1. Consider a finite field Fg, where q = , for p a prime and h a nonnegative integer. Given 
a positive integer k, what is the largest integer n, such that there exists a k x n matrix H over Fg, in which 
every set of k columns is linearly independent? 

Such a matrix, upon its existence, could be the generator matrix of an [n, k] maximum-distance-separable 
(MDS) code [4], which can correct up to c? = n — fc bits of erasures or t = d/2 bits of errors. We will thus 
refer to H as an MDS matrix. Its existence also determines the representability of uniform matroids, which 
we will discuss in detail in Section 14.31 The maximal value of n, according to the MDS conjecture, is 9+1, 
unless 9 = 2^ and fc = 3 or fc = 9 — 1, in which case n < 9 + 2. This conjecture has been recently proved for 
any 9 = p by Ball mm- But a complete proof of it remains open. 

Therefore, it is crucial to understand the construction of fc x (9 + 1) MDS matrices. In coding theory 
literature, many construction algorithms have been proposed to meet certain coding requirements. However, 
their computational complexity is not necessarily satisfactory. On one hand, multiplications and additions 
over large finite field are required in the matrix construction. On the other hand, the resultant MDS matrix 
may have a low sparsity (or high density), which is measured by the number of zeros in the matrix. For 
example, Reed-Solomon codes have no zeros in its generator matrix. A low sparsity can be translated into 
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higher encoding and decoding complexity. It is an open question regarding how these algorithms can be 
generalized and provide new insights into related fields, such as network coding theory and matroid theory. 

In this paper, we investigate the above problems by first proposing in Section [ 5 ] a new type of MDS 
matrix called a supplemented Pascal matrix. A supplemented Pascal matrix can be generated by additions 
and, in particular, without multiplications. It also has guaranteed number of zero entries for high sparsity. 
We will prove that a supplemented Pascal matrix is an MDS matrix in Section [ 3 ] We will then extend our 
results into a general code construction framework in section lPTl and then discuss its applications to network 
coding theory and matroid theory in sections 14.21 and 14.31 respectively. 


2 Definitions 


For clarity we should first label the elements of a finite field. Henceforth, let p be a prime and h be a 
nonnegative integer. A finite field contains q = p^ elements, each represented by a polynomial g{x) = 
Pix'’, whose coefficients are £ [ 0 ,p — 1 ]. Substituting a; = p to a different g{x) will yield a 

different value between 0 and g — 1 , which is an intuitive index of the corresponding element. Specifically, 
we define a index function fjqin): 

Definition 2.1. For any integer n £ [ 0 ,g— I], crq{n) is the element of¥q whose polynomial coefficients 
satisfy YhZo PiP" = n. 

For example, given q = 2^, we have 0-^(0) = 0 , cjq(l)=l, and (Jqffi) = + 1. 

Based on aq{n), we define a finite field binomial polynomial fmffi)'. 


/m(n) 


1 = W,{n)r, 

r-rm crq(ra)-g-„(z-l) 

.Il*=i O’?!*) ' 


m = 0 
m > 0 


( 1 ) 


where {m,n} £ [ 0 ,q — 1 ] are non-negative integers. Intuitively, /m(n) is a polynomial of crg(n) of degree m. 
Based on /m(n), we introduce the key matrix in this paper, called the Pascal matrix: 


Definition 2.2. 


The upper-triangular Pascal matrix Pq over is a qxq matrix with its element Pq (to, n) = 


/o(0) 

/o(l) •• 

fo{q - 1) 

/i(0) 

/i(l) •• 


Vi(0) 


1 

• • 

1 

h-^ 


For brevity, we call the upper-triangular Pascal matrix the Pascal matrix. 


Note that the matrix index starts from 0 . For example, when g = 2 ^ = 4 , we have: 


Example 2.3. 



1 

1 

0 

0 


1 

X 

1 

0 


1 

a; -I -1 
a; -I -1 
1 


Our considered matrix is named after Pascal because its entries are binomial coefficients, which is the 
same as traditional Pascal matrix, except that the field applied is F^ and Z>o, respectively. Indeed, when 
q = p, Pp is equal to the traditional Pascal matrix modulo-p. For example, when q = p = 5: 


Example 2.4. 
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traditional 


'1 

1 

1 

1 
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'1 

1 

1 

1 
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1 

2 

3 

4 

0 

0 

1 

3 

6 

v.s. Ps = 

0 

0 

1 

3 

1 

0 

0 

0 

1 

4 


0 

0 

0 

1 

4 

0 

0 

0 

0 

1 


0 

0 

0 

0 

1 
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Indeed, the construction of the Pascal matrix under Fp shares the same additive formula as the traditional 
Pascal matrix. Explicitly, Pp(m, n) = Pp(TO — 1, m — 1) + Pp(m, n — 1) for every pair of {m, n} G [1, Q — 1] 
(note that addition is Mod-p). This idea appears in section 4.2. 


Definition 2.5. The truncated Pascal matrix Pq.fe is the Pascal matrix Pq truncated to the first k rows. 


Example 2.6. 


P 


5,2 — 


1 

0 


1 

1 


1 1 
2 3 


Definition 2.7. A supplemented Pascal matrix, denoted hy is a truncated Pascal matrix Pq^k appended 

with a column vector Sk, which has a one in the bottom entry and zeroes everywhere else: 


Example 2.8. 


-1 

1- 

o • • • 

0 

1- 
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5,2 — 
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0 


1 

1 


1 1 
2 3 


0 

1 


Our supplemented Pascal matrix has a desirable property, namely: 


Theorem 2.9. Any k columns ofUq^k are linearly independent. 


(3) 


3 Proof of Theorem 12.9 


We will first prove the following property of Pq,k, and then prove that tlq,k preserves this property. 
Lemma 3.1 (Truncation Lemma). Any k columns ofPq ^. are linearly independent. 

Proof. To prove it, we first note that Pq (and thus Pq.fc) has two important properties: (m begins at 0.) 

1 . All the entries in the m-th row are defined by the same polynomial fm(n), which has a degree of m; 

2. This polynomial has m roots, which are {o’g(n)}((TQ^. Consequently, the first m entries of the m-th 
row are all zeros. 


Given a truncated Pascal matrix Pq^k, our hypothesis (to be disproved) is that there exist k distinc¬ 
tive values of n, say {no,ni, ■ ■ ■ ,nk-i}, such that columns {no,ni, • • • ,nfe_i} of Pq^k constitute a linearly 
dependent set. In other words, if our hypothesis is valid, then there exists an k x k sub-matrix M of Pq,k- 


fo{no) fi{ni) ■■■ /i(nfc_i) 

hino) f2{ni) ■■■ /2(nfc_i) 

fk-i{no) /fc-i(ni) ••• fk-iiuk-i) 


(4) 


whose rank is smaller than k. 

If this is the case, then there must exist a length-A: non-zero vector a S such that a x M = z, where 
z is an all-zero vector of length k: 


[oq, 02, • • • , afe-ila X M = [0,0, • • • , 0]z 

Recall that the m-th row of (and thus M) is defined by fmfn). Correspondingly, z is defined by: 


fc-i 

fin) = ^ amfm{n), 

m—0 
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where z(fc) = f'{nk) = 0 for all m G [0, fc — 1]. We also note that the degree of f'{n) is at most fc — 1, because 
the highest degree of its summands is the degree of fk-i{n) with a value of fe — 1. 

Therefore, if our hypothesis is valid, i.e., if columns-{no, ni, • • • ,nk-i} of Uq^k constitute a linearly 
dependent set, then we will obtain a polynomial f'(n) such that: 


• Its degree is at most fe — 1; 


• It has fe roots, whose values are {aq{no),aq(ni), ■ ■ ■ ,aq{nk-i)}. 


However, with a degree of at most fc — 1, f'{n) can only have at most fc — I roots unless f'{n) = 0, 
which is not the case because a is non-zero. Hence, f'{n) does not exist, and thus our hypothesis is invalid. 
Therefore, every fe columns of Uq^k are linearly independent. □(lemma) 

Since H,, k is constructed by appending Sk to Pg,fc, to prove Theorem l2.9l we only need to prove that any 
fe — 1 columns of Pq^k and Sk together never constitute a linearly dependent set. To see this, we can simply 
use Sfc to linearly cancel the first q entries in the last row of Pi-q^k- This will transform from ([3]) into: 



0 ■ 

P q,k — l 

• o 

1 - 

O 

O 

1 


(5) 


which indicates that Sk is orthogonal to all the other columns of Then, by applying the truncation 

lemma to Pq^k-i, we know that every fe — I out of the first q columns of H-'qk are linearly independent. 
Adding Sk to them will yield a linearly independent set of fe. Theorem 12.91 is thus proved. □ 


4 Applications 

4.1 Coding Theory 

The truncation lemma can be immediately generalized to any appropriately defined fe x n matrix that satisfies: 
1) n < g, and 2) the m-th (m G [0,fc — 1]) row is defined by a polynomial of degree m. For example, by 
setting fm(n) = aq{n)'^~^, we can obtain a fe x n matrix under such that every set of fc columns is a 
linearly independent set. Indeed, this matrix is the generator matrix G of a (n, fc) Reed-Solomon code: 


r ^,(1)° 

^,(2)° • 

cr,(n)0 

o-g(l) 

^,(2) • 

aq{n) 



<7q{nf 





Then by appending Sk, we can obtain a fe] Reed-Solomon code. Therefore, our polynomial approach 

is a general approach of constructing non-trivial [n, fc] MDS codes. It also indicates that the maximum length 
of any MDS code is at least g -|- I for any k ^ q. This result well resonates the MDS conjecture mm- 

Among all the possible constructions, the supplemented Pascal matrix tiq,k enjoys a high sparsity, which 
is the number of zeros in the matrix. Higher sparsity is advantageous, because it generally leads to easier 
encoding/decoding. However, the sparsity has an upper bound. In the following lemma, we will prove that 
ilq,k approximates this bound with a factor of 

Lemma 4.1 (Matrix Sparsity). The number of zeros in the supplemented Pascal matrix tiq,k is ^ of the 
maximum sparsity of any (n, fc) code. 

Proof. Since any fc x fc sub-matrix of G has a rank of fc, there is no all-zero row in this matrix. Hence, there 
is at most fc — 1 zeros in each row of G, and at most + k zeros in total. Recall that in Hq^k the m-th row 
(m G [0, fc — I]) has m zeros. The total number of zeros is ^ , which is half of the maximum. □ 
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4.2 Network Coding Theory 

Network coding (NC) is a class of packet-based coding techniques. Consider a block oi K >\ data packets 
each containing L bits of information. NC treats these data packets as K variables, and sends in 
the u-th (u S [0, -foo]) transmission a linear combination y„ of all of them: 


K-l 

ytt = X! (6) 

fc =0 

where coefficients {o-k}k=o elements of a finite field F^. 

Ideally, NC is able to allow any receiver that has received any K coded packets to decode all the K data 
packets by solving a set of K linear equations. To this end, the associated coefficient matrix C, where 



ao,o 

ao,! 


c = 

ctip 

ai,i 



CUK-l.O 

aif-1,1 ■ ■ • 



must satisfy that every set of K columns of it is a linearly independent set. Once this condition is met, NC 
is able to achieve the optimal throughput in wireless broadcast scenarios |16| . 

However, it is highly non-trivial to meet this condition, which hinders the implementation of NC. First, to 
guarantee the linear independence, the sender either chooses coefficients randomly from a sufficiently large ¥q 
[IHIZ] or regularly collect receiver feedback to make online coding decisions [6] . While large incurs heavy 
computational loads, collecting feedback could be expensive or even impossible in certain circumstances, such 
as time-division-duplex satellite communications m- Second, to enable the decoding, coding coefficients 
must be attached to each coded packet, which constitute \Klog 2 q~\ bits of overhead in each transmission. 
When q is large and L is small, the throughput loss due to the overhead may overwhelm all the other benefits 
of NC. 

These practical shortages of NC can be easily overcome by the proposed supplemented Pascal matrix. 
By choosing a sufficiently large p and let C = we obtain an NC that is both computational friendly 

(only Mod-p operations) and feedback-free. Moreover, for the receivers to retrieve the coding coefficients, 
the sender only needs to attach the index u to the w-th packet, rather than attaching the complete coeffi¬ 
cients. Furthermore, the additive formula for Pascal matrix may enable efficient progressive coding/decoding 
algorithms, which could be our future research direction. 

4.3 Matroid Theory 

A matroid M = (A, X) is a finite collection of elements called the ground set, E, paired with its comprehensive 
set of independent subsets, X. A uniform matroid has \E\ = n and the property that any size k subset 
of E is an element of X and no size (fc -I- 1) subset is in X. is called g-representable if there is a fc x n 
matrix such that every k columns of it are linearly independent under F^. 

Corollary 4.2 (Representability of Uniform Matroid). Any uniform matroid that satisfies n ^ q+1 is 
q-representable by any n columns oftlq^k- D 

The statement: Any uniform matroid that satisfies n ^ q + 1 is q-representable is known [MlISlIIl]; 
one can obtain another construction from Reed-Solomon codes. is just another, sparse example. 


5 Conclusion 

In this paper, we proposed the supplemented Pascal matrix, whose first k rows is an MDS matrix under 
Fq for any prime power q and positive integer k ^ q. Our construction can be potentially generalized 
to a framework that enables low-complexity MDS code constructions and encoding/decoding as well. Our 
matrix can overcome some practical shortages of network coding and, thus, enables high-performance wireless 
network coded packet broadcast. Our matrix resonates with existing results on the representability of uniform 
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matroids, while also providing new insights into this topic. In the future, we intend to study Pascal-based 
network coding algorithms. We are also interested in applying our results to other fields such as projective 
geometry and graph theory. 
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